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Abstract 

A temporal reasoning problem can often be naturally char- 
acterized as a collection of constraints with associated local 
preferences for times that make up the admissible values for 
those constraints. Globally preferred solutions to such prob- 
lems emerge as a result of well-defined operations that com- 
pose and order temporal assignments. The overall objective 
of this work is a characterization of different notions of global 
preference, and to identify tractable sub-classes of temporal 
reasoning problems incorporating these notions. This paper 
extends previous results by refining the class of useful notions 
of global temporal preference that are associated with prob- 
lems that admit of tractable solution techniques. This paper 
also answers the hitherto open question of whether problems 
that seek solutions that are globally preferred from a utilitar- 
ian criterion for global preference can be found tractably. 

Introduction 

Many temporal reasoning problems can be naturally char- 
acterized as collections of constraints with associated local - 
preferences for times that make up the admissible values for 
those constraints. For example, one class of vehicle routing 
problems (Toth and Vigo 2001) consists of constraints on re- 
quested service pick-up or delivery that allow flexibility in 
temporal assignments around a specified fixed time; solu- 
tions with assignments that deviate from this time are con- 
sidered feasible, but may incur a penalty. Similarly, dynamic 
scheduling problems (ElSakkout98 et at. 1998), whose con- 
straints may change over time, thus potentially requiring so- 
lution revision, often induce preferences for revised solu- 
tions that deviate minimally from the original schedule. 

To effectively solve such problems, it is necessary to be 
able to order the space of assignments to times based on 
some notion of global preference, and to have a mechanism 
to guide the search for solutions that are globally preferred. 
Such a framework arises as a simple generalization of the 
Simple Temporal Problem (STP) (Dechter et al 1991), in 
which temporal constraints are associated with a local pref- 
erence function that maps admissible times into values; the 
result is called Simple Temporal Problem with Preferences 
( STPP ) (Khatib et al 2001). Globally optimal solutions 
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to STPPs emerge as a result of well-defined operations that 
compose and order partial solutions. 

Different concepts of composition and comparison result 
in different characterizations of global optimality. Past work 
has introduced three notions of global preference: Weakest 
Link (maximize the least preferred time), Pareto, and Utili- 
tarian. Much of the work to date has been motivated by the 
overall goal of finding tractable solutions to temporal opti- 
mization problems with realistic global preference criteria. 
In particular, at NASA we are motivated to create systems 
that will automatically find optimally prefered solutions to 
problems in the rover planning domain (Bresina et al 1997), 
where the goal is to devise plans for visiting a number of sci- 
entifically promising science targets. 

In addition to reviewing the STPP framework (section 2), 
this paper extends previous results motivated by the overall 
goal of identifying useful notions of global preference that 
correspond to problems that can be solved tractably. First, 
we introduce a new category of global optimality called 
stratified egalitarian optimality, and prove that it precisely 
characterizes the subset of Pareto optimal solutions returned 
by a tractable technique introduced previously (section 3). 
Second, we provide an affirmative answer to the question of 
whether utilitarian optimal solutions to temporal reasoning 
problems can be also found tractably within this framework 
(section 4). This paper closes with a discussion of results of 
preliminary experiments and future work. 

Simple Temporal Problems with Preferences 

This section is a summary of previous work on optimiz- 
ing temporal preferences. A temporal constraint depicts re- 
strictions on the distance between arbitrary pairs of distinct 
events. In (Khatib et al 2001), a soft temporal constraint 
between events i and j is defined as a pair (/, A?)* where I 
is a set of intervals {[a, b], a < b} and fij is a local prefer- 
ence function from (J I to a set A of admissible preference 
values. For the purposes of this paper, we assume the val- 
ues in A are totally ordered, and that A contains designated 
values for minimum and maximum preference. 

When I is a single interval, a set of soft constraints de- 
fines a Simple Temporal Problem with Preferences (STPP), 
a generalization of Simple Temporal Problems [Dechter, et 
al , 1991]. An STPP can be depicted as a pair (U, C) where 
V is a set of variables standing for temporal distances, and 



preference 



Figure 1: The STPP for a Rover Science Planning Problem 
(T is any timepoint) 


C = {([Qijibij],fij)} is a set of soft constraints defined 
over V . An STPP, like an STP, can be organized as a network 
of variables representing events, and links labeled with con- 
straint information. A solution to an STPP is a complete as- 
signment to all the variables that satisfies the temporal con- 
straints. 

We define a preference vector of all the local preference 
values associated with a set F = {fij} of local preference 
functions and a solution S. Formally, let fij(S) refer to 
the preference value assigned by fij to the temporal value 
that S assigns to the distance between events i and j 9 and 
Uf{S) — (/i2(5),/i 3 (5).. • fn-in(S)) be the preference 
vector associated with F and S. In what follows the context 
will permit writing Us instead of U F ( S ) without ambiguity, 
and Ug will refer to the k th preference value of Us- 

For an example of an STPP, consider a simple Mars rover 
planning problem, illustrated in Figure 1. The rover has a 
sensing instrument and a CPU. There are two sensing events, 
of durations 3 time units and 1 time unit (indicated in the 
figure by the pairs of nodes labeled ins® , ins* and ins^ , ins^ 
respectively). The event T depicts a reference time point 
(sometimes referred to as “the beginning of time”) that al- 
lows for constraints to be specified on the start times for 
events. There is a hard temporal constraint that the CPU 
be on while the instrument is on, as well as a soft constraint 
that the CPU should be on as little as possible, to conserve 
power. This constraint is expressed in the STPP as a func- 
tion from temporal values indicating the duration that the 
CPU is on, to preference values. For simplicity, we assume 
that the preference function min on the CPU duration con- 
straints is the negated identity function; i.e., minij (t ) = -t; 
thus higher preference values, i.e. shorter durations, are pre- 
ferred. 

A solution to an STPP has a global preference value , ob- 
tained by combining the local preference values using oper- 
ations for composition and comparison. Optimal solutions 
to a STPP are those solutions which have the best prefer- 
ence value in terms of the ordering induced by the selected 
comparison operator. Solving STPPs for globally preferred 


assignments has been shown to be tractable, under certain 
assumptions about the “shape” of the local preference func- 
tions and about the operations used to compose and com- 
pare solutions. First, consider a class of local preference 
functions that includes any function such that if one draws a 
horizontal line anywhere in the Cartesian plane of the graph 
of the function, the set of X such that f(X) is not below the 
line forms an interval. This class of semi-convex functions 
includes linear, convex, and also some step functions. 

Second, consider an operator for composing solutions 
which is based on the minimal value of the component solu- 
tions, and when the operator for comparing solutions returns 
the maximum of the two solutions. Globally preffered solu- 
tions will be called Weakest Link Optimal (WLO), and are 
comprised of solutions that are maximal with respect to the 
least preferred local value. More precisely, given prefer- 
ence vectors Us and Us' corresponding to distinct solutions 
5 and S', we will say that S is WLO-preferred to S', or S' 
is WLO-dominated by S, if min(Us’) < rnin(Us ), where 
min(U) returns the minimal value in a vector U. WLO- 
optimal solutions are those to which no other solutions are 
WLO-preferred. 

STPPs with semi-convex preference functions for WLO- 
optimal solutions can be solved tractably by a process called 
the chop method. This method is based on the act of “chop- 
ping” a preference function (Figure 2). Semi-convexity im- 
plies that the set of times for which the preference function 
returns a value above a selected chop point forms a convex 
interval; call this interval the chop-induced constraint. For 
a set of preference functions in an STPP, chopping all of 
them at the same preference value induces a Simple Tem- 
poral Problem, namely, of finding a set of assignments that 
satisfies all the chop-induced constraints. A binary search 
will return the largest preference value v opt for which a so- 
lution to the induced STP exists; it can been shown that the 
solutions at v opt are WLO-optimal. 

Because the chop method returns the solution to an STP, 
its output is a flexible temporal plan , i.e., a set of solutions 
that have the same WLO-optimal value. Flexibility is of- 
ten considered important in ensuring robustness in an execu- 
tion environment that is uncertain (Muscettola Ct ah 1998). 
Nonetheless, the WLO criterion for globally preferred solu- 
tions has the disadvantage of being “myopic”, in the sense 





that it bases its evaluation on a single value. This feature 
can be shown to limit its usefulness in solving real tempo- 
ral planning problems. The rover example in Figure 1 can 
be used to illustrate this myopia. Because the CPU must be 
on at least as long as the sensing events, any globally pre- 
ferred solution using WLO has preference value -3. The set 
of solutions that have the WLO-optimal value includes so- 
lutions in which the CPU duration for the second sensing 
event varies from 1 to 3 time units (again, since WLO bases 
its evaluation solely on the least preferrred value). The fact 
that WLO is unable to discriminate between the global val- 
ues of these solutions, despite the fact that the one with 1 
time unit is obviously preferable to the others, can be clearly 
viewed as a limitation. 

Less myopic global preference criteria can be defined, but 
one issue of interest is whether tractable methods exist for 
solving STPPs based on them. For example, we can say 
that S’ Pa re to - dom i nates S if for each j, U 3 S < U 3 S , and for 
some k, Ug < Ug, . The Pareto optimal set of solutions is 
the set of non-Pareto-dominated solutions. Similarly, we can 
say that S’ Utilitarian-dominates S if Yj Ug < Yj Ug, y 
and the Utilitarian optimal set of solutions is the set of non- 
Utilitarian-dominated solutions. 

In a previous result (Khatib et al 2003), it was shown 
that a restricted form of Pareto-optimality can be achieved 
by an iterative application of the chop method. The intuition 
is that if a constraint solver could “ignore” the weakest link 
values (i.e. the values that determined the global solution 
evaluation) then it could eventually recognize solutions that 
dominate others in the Pareto sense. The links to be ignored 
are called weakest link constraints: formally, they comprise 
any link in which the optimal value for the preference func- 
tion associated with the constraint is the same as the WLO 
value for the global solution. Formalizing the process of “ig- 
noring” weakest link values is a two-step process of commit- 
ting the flexible solution to consist of the interval of optimal 
temporal values, and reinforcing this commitment by reset- 
ting their preferences to a single, “best” value. Formally, the 
process consists of: 

• Squeezing the temporal domain to include all and only 
those values which are WLO-optimally preferred; and 

• Replacing the preference function to one that assigns the 
highest (most preferred) value to each element in the new 
domain. 

The first step ensures that only the best temporal values are 
part of any solution, and the second step allows WLO to be 
re-applied to eliminate Pareto-dominated solutions from the 
remaining solution space. The resulting algorithm, called 
WLO+ returns, in polynomial time, a Simple Temporal 
Problem (STP) whose solutions are a nonempty subset of 
the WLO-optimal, Pareto-optimal solutions to a STPP. The 
algorithm WLO+, reproduced in Figure 3 for completeness, 
returns a Simple Temporal Problem (STP) whose solutions 
are precisely the WLO-optimal, Pareto-optimal solutions to 
the original STPP, P . Where C is a set of soft constraints, 
the STPP (V, Cp) is solved (step 3) using the chop approach. 

In step 5, we depict the soft constraint that results from 


Inputs: an STPP P = (V, C) 

Output: 

A STP (V, Cp) whose solutions are Pareto optimal for P. 

(1) C P = C 

(2) while there are weakest link soft constraints in Cp do 

(3) Solve (V, C P ) 

(4) Delete all weakest link soft constraints from Cp 

(5) For each deleted constraint ([a, b],f); 

(fi) add {[flopt j bopt\i fbest ) to Cp 

(7) Return ( V , Cp) 


Figure 3 : STPP solver WLO+ returns a solution in the Pareto 
optimal set of solutions 


the two-step process described above as {[a opt , b opt ], fbest), 
where [a opt , b opt ] is the interval of temporal values that are 
optimally preferred, and fbest is the preference function that 
returns the most preferred preference value for any input 
value. Notice that the run time of WLO- b is 0{\C\) times 
the time it takes to execute Solve(V , Cp ), which is a poly- 
nomial. 

WLO+, applied to the rover example in Figure 1, finds a 
Pareto optimal solution in two iterations. In the first itera- 
tion, the weakest link is that between the start and end of 
the first CPU event. WLO+ deletes this link and replaces 
it with one with the interval [3, 3] and the local preference 
function fbest • This new STPP is then solved on the second 
iteration, whereby the WLO-optimal solution with the CPU 
duration of 1 is generated. The solution to this STPP is a 
Pareto-optimal solution to the original problem. 

WLO+ was a positive result in the search for tractable 
methods for finding globally preferred solutions based 
on less myopic criteria for global preference than WLO- 
optimality. We now proceed to refine and expand these re- 
sults in two ways: first by offering a more concise charac- 
terization of the class of solution returned by WLO+, and 
secondly, by showing how restricted classes of STPP with 
a Utilitarian criterion for global preference can be solved 
tractably. 

WLO+ and Stratified Egalitarianism 

As noted in the previous section, the minimal network re- 
turned by running WLO+ on a STPP is a subset of the set of 
Pareto Optimal Solutions for that problem. In this section, 
we present a concise description of this set. 

We introduce a concept of global preference called Strati- 
fied Egalitarianism (SE). Consider again two preference vec- 
tors Us and Us f associated with solutions S and S’. We will 
say S' SE-dominates S at preference level x if: 

• U' s < x implies Ug, >U' S . 

• There exists an i such that Ug < x and Ug, > Ug. 

•U' s > x implies Ug, > x. 

We say that S' SE-dominates S (without further qualifi- 
cation) if there is any level x such that S' dominates S at x. 



It is not hard to see that the SE-dominance relation is anti- 
symmetric and transitive 1 , thus inducing a partial ordering 
of solutions. A solution S' will be said to be SE-optimal if 
it is not SE-dominated. Note that if a solution S' Pareto- 
dominates S, then S' SE-dominates S at the “highest” level 
of the S' vector. Thus, SE-optimality implies Pareto opti- 
mality. Furthermore, if S' is superior to S in the WLO or- 
dering, then S' SE-dominates S at the “lowest” level of the 
S f vector. Thus, SE-optimality also implies WLO optimal- 
ity. 

Using an economic metaphor to ground intuition, x rep- 
resents a sort of poverty line , and a “policy” S' has a better 
overall quality than S if some members below the poverty 
line in S are improved in S', even if some of those above the 
poverty line in S are made worse off in S' (as long as they 
do not drop below the poverty line). This metaphor suggests 
that SE-optimality could be a reasonable criterion for spec- 
ifying globally prefered solutions. We now prove that the 
WLO+ algorithm finds exactly the SE-optimal solutions. 

Theorem 1 The set of solutions returned by WLO+ is pre- 
cisely the set of SE-optimal solutions. 

Proof: 

Consider a solution S not returned by WLO+, i.e., one 
that is eliminated at some iteration of the WLO+ algorithm; 
let the optimal value (i.e., value of the weakest link) of the 
set of solutions be v at that iteration. Let S' be any survivor 
at that iteration. There must be some link i such that Ug < v 
(otherwise 5 wouldn’t be eliminated). But Ug, > v since S' 
survives. Thus, Ug, > Ug . Note also that U 3 S , > v for all 
links j. Thus, for any value k such that Ug < v, we have 
Ug, > Ug.lt follows that S is dominated at stratum v. 

Conversely, suppose S is dominated at some stratum v 
but, for the sake of contradiction, suppose S is not excluded 
from the set of solutions returned by WLO+. From the as- 
sumption that S is dominated at stratum v , there exists an 
S' and i such that v > U l s and Ug , > Ug , and for any j, 
U J S / < v implies U 3 S , > Ug. During the execution of the 
WLO+ algorithm, an increasing sequence of preference val- 
ues V = v \ , u 2 , . . . , vn = 1 (where 1 is the “best” prefer- 
ence value) is created, representing the WLO optimal values 
at each iteration. Clearly, Ug < 1 (where 1 is the “best” 
preference value), so one of the Us must exceed Ug. Sup- 
pose vk is the smallest element in V such that vk > Ug. 
Note that 5 would be removed at this iteration, as a result 
of its being not WLO optimal, unless the preference func- 
tion for link i had been reset at an iteration J < K. But 
that function would get reset only if i was a weakest link at 
J. Then vj < Ug since J < K, and vk is the smallest V 
such that vk > Ug. Note however, that for all links j, either 
U > v > vj or Ug, > Ug. Thus, S' would have survived 
to this iteration if S had. However, Ug, > Ug > vj, which 
contradicts the fact that i is a weakest link. □ 


Recall that we assumed the preference values to be totally or- 
dered. 



Utilitarian Optimality 

Perhaps the most natural criterion for global optimality is 
utilitarian , where the global value of a solution is the sum of 
the local values. In this section, we consider applying a util- 
itarian global optimality concept to the temporal preference 
problem. We show that finding a utilitarian optimal solu- 
tion is tractable in the case where all the preference func- 
tions are convex and piecewise linear. Piecewise linear pref- 
erence functions characterize soft constraints in many real 
scheduling problems; for example, in vehicle routing (where 
the best solutions are close to desired start times) and in dy- 
namic rescheduling (where the goal is to find solutions that 
minimally perturb the original schedule). 

Consider an STPP with preferences F = {fij}, and as- 
sume that the goal is to find a utilitarian optimal solution S, 
i.e. where J2ij fij(S) is optimal. Suppose each fij is con- 
vex and piecewise linear. Thus, there is a sequence of inter- 
secting line segments that make up fij- We will denote the 
individual linear functions corresponding to the segments by 
fij , f?j , • * * , f ™' j , as illustrated in Figure 4. 

In this case, we will show that the utilitarian optimization 
problem can be reduced to a Linear Programming Problem 
(LPP), which is known to be solvable in polynomial time 
by Karmarkar’s Algorithm (Cormen et al 1990). This re- 
sult generalizes the observation in (Khatib et al. 2001) that 
STPPs with linear preference functions can be mapped into 
an LPP. 

Since the /’ s are convex, notice that y < fij(x) if and 
only ify< /£•(*) A y < /?-(x) A . . . y < f™ i} (x). (See 
Figure 4.) For the LPP, we introduce an auxiliary variable 
Zij, together with rriij additional linear constraints of the 
form 

Za < f&(S) 

for each fij. We also introduce a set of variables X = 
{X 1 ,X 2 , . . . X n ] where X t and X 3 represent, respectively, 
the start and end points fij . An interval \pi 3 , qi 3 ] denotes the 
domain of fij. 

The complete LPP can now be formulated as follows. 
The indices are assumed to range over their available val- 
ues, which should be clear from the above discussion. Note 


that ij in {f tJ } and {Zij } range over the edges associated 
with preferences. This could be a small subset of the entire 
edges in real applications. Finally, we introduce a variable 
Sij for each temporal distance assignment in a solution. 

• Variables: {X,}, {S^}, and {Zij). 

• Constraints (conjunctive over all values of the indices): 

1. Sij = Xj - Xi 

2- Pij £ Sij £ Qij 
3. < fij(S) 

• Objective Function: J2ij Zij 

Theorem 2 The solution to the LPP as formulated above 
provides a utilitarian optimal solution to the STPP. 

Proof: 

Consider the candidate STPP solution 5 obtained from 
the values of the {Xj} variables in an optimal solution of 
the LPP. Clearly, the constraints (items 1 and 2) guarantee 
that 5 satisfies the STP underlying the STPP It only re- 
mains to show that it is optimal in the utilitarian ordering 
for the STPP. From the constraints in item 3, we see that 
Zij < fij(S) for each ij and hence Z i3 < f tJ {S). We claim 
that Z^ = fij (S ) . To see this, note that the Z^ variables can 
be varied independently without affecting the constraints in 
items 1 and 2. If Z^ < fij(S), then the objective func- 
tion can be increased, without violating any constraints, by 
increasing Zij to fij(S) y which contradicts the assumption 
that the solution is already optimal. Thus, Z tj = fij(S) for 
each ij, and so ^ Z {j = fij(S) 

Suppose now there was a better solution S' for the STPP 
in terms of the utilitarian ordering. Then fij (S') > 
J2ij fij(S) — J2ij Zij . Observe that we can now formu- 
late a better solution to the LPP based on S' (where we set 
Z[- — fij{S')) 9 which is a contradiction. Thus, the S ob- 
tained from the LPP is also optimal for the STPP. □ 

Example The rover STPP, shown in Figure 1, maps into 
the following LPP: 

• Variables 

{ cpu { , cpul, ins{, ins\ , cpu 2 , C P U 2 ^ 

{d C p-Uj ,cpuj > d C p U *j ns * , . . . }» and {zi,z 2 } 

• Constraints 

ins{ — T — 2; insl —T = 9; 

cpul ~ C P U l — 0; i ns i ~ C P U i > 

cpu\ — ins\ > 0; ins\ — ins[ = 3; 
cpul ~ C P U 2 > 0; ins s 2 — cpul > 0; 

cpul ~ i n$ 2 > 0; insl — ins s 2 = 1; 

Z 1 S: “^cpu^,cpu* 5 z 2 S: i 

• Objective 

maximize(z\ + z 2 ) 

The rover example introduces only linear preference con- 
straints, a special case of piece- wise linear functions. Notice 
that the Zi stand for preference values, defined as the nega- 
tion of the duration assigned to the link. 


Discussion and Conclusion 

A potential for tradeoff emerges in the fact that WLO+ 
generates flexible temporal plans that are guaranteed SE- 
optimal but not necessarily Utilitarian optimal with respect 
to preferences, whereas an LPP solver of the same problem 
generates a fixed plan that is guaranteed to be utilitarian op- 
timal. But if WLOi- tends to generate solutions that are al- 
ways either utilitarian or “nearly” utilitarian optimal, then it 
may be considered a better choice for solving many prob- 
lems due to its robustness in generating flexible plans. Ex- 
periments are currently underway evaluating the utilitarian 
quality of WLO+ solutions over a diverse set of problems, 
as well as to compare the run-time performance of the two 
formulations. The results of these experiments are too pre- 
liminary to summarize here, and will be the subject of a fu- 
ture report. 

Another long range goal of this work is to integrate prob- 
abilities with preferences in temporal constraint reasoning, 
incorporating the results of (Tsamardinos et ah 2003). This 
integration will allow potentially for a formulation of the ex- 
pected utility paradigm of classical decision theory (Savage 
1954) within a constraint-based setting. 

The work reported here contributes to the overall goal of 
increasing the adeptness of automated systems for planning 
and scheduling. The objectives of this work overlap with 
those of a number of diverse research efforts. First, this 
work offers an alternative approach for reasoning about pref- 
erences to approaches based on multi-objective decision the- 
ory (Bacchus and Grove 1995). This work also contributes 
to, and builds upon, the on-going effort to extend CSP algo- 
rithms and representations to solve optimization problems 
or problems where knowledge is uncertain (for example, 
(Dubois et al 1996)). Finally, the focus on solving prob- 
lems involving linear piece- wise constraints Jias similarities 
to other efforts more grounded in Operations Research (for 
example, (Ajili etal 2003)). 
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